Multiblock Discriminant Analysis for Integrative Genomic Study

نویسندگان

  • Mingon Kang
  • Dong-Chul Kim
  • Chunyu Liu
  • Jean Gao
چکیده

Human diseases are abnormal medical conditions in which multiple biological components are complicatedly involved. Nevertheless, most contributions of research have been made with a single type of genetic data such as Single Nucleotide Polymorphism (SNP) or Copy Number Variation (CNV). Furthermore, epigenetic modifications and transcriptional regulations have to be considered to fully exploit the knowledge of the complex human diseases as well as the genomic variants. We call the collection of the multiple heterogeneous data "multiblock data." In this paper, we propose a novel Multiblock Discriminant Analysis (MultiDA) method that provides a new integrative genomic model for the multiblock analysis and an efficient algorithm for discriminant analysis. The integrative genomic model is built by exploiting the representative genomic data including SNP, CNV, DNA methylation, and gene expression. The efficient algorithm for the discriminant analysis identifies discriminative factors of the multiblock data. The discriminant analysis is essential to discover biomarkers in computational biology. The performance of the proposed MultiDA was assessed by intensive simulation experiments, where the outstanding performance comparing the related methods was reported. As a target application, we applied MultiDA to human brain data of psychiatric disorders. The findings and gene regulatory network derived from the experiment are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Sequential Algorithm for Multiblock Orthogonal Projections to Latent Structures.

Methods of multiblock bilinear factorizations have increased in popularity in chemistry and biology as recent increases in the availability of information-rich spectroscopic platforms has made collecting multiple spectroscopic observations per sample a practicable possibility. Of the existing multiblock methods, consensus PCA (CPCA-W) and multiblock PLS (MB-PLS) have been shown to bear desirabl...

متن کامل

A tutorial on multiblock discriminant correspondence analysis (MUDICA): a new method for analyzing discourse data from clinical populations.

PURPOSE In communication disorders research, clinical groups are frequently described based on patterns of performance, but researchers often study only a few participants described by many quantitative and qualitative variables. These data are difficult to handle with standard inferential tools (e.g., analysis of variance or factor analysis) whose assumptions are unfit for these data. This art...

متن کامل

Matrix-Variate Discriminative Analysis, Integrative Hypothesis Testing, and Geno-Pheno A5 Analyzer

A general perspective is provided on both on hypothesis testing and discriminative analyses, by which matrix-variate discriminative analyses are proposed based on the matrix normal distribution, featured by a bi-linear extension of Fisher linear discriminant analysis and a further extension to binary variables. Moreover, a general formulation is proposed for integrative hypothesis testing and f...

متن کامل

بررسی ساختار جمعیتی گاوهای بومی ایران با استفاده از تحلیل افتراقی مؤلفه‌های اصلی

Effective management of genetic resources in the domestic animals is based on characterization of genetic structure and diversity among populations. Strategies reducing complexity and dimensions of data are required to analyze the genetic relationships between populations based on dense genomic data. The objective of this study was to use the discriminant analysis of principal components (DAPC)...

متن کامل

Batch Process Monitoring Using Multiblock Multiway Principal Component Analysis

Batch process monitoring to detect the existence and magnitude of changes that cause a deviation from the normal operation has gained considerable attention in the last decade. There are some batch processes that occur as a single step, whereas many others include multiple phases due to operational or phenomenological regimes or multiple stages where different processing units are employed. Hav...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2015  شماره 

صفحات  -

تاریخ انتشار 2015